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Abstract 



We address a problem connected to the unfolding semantics of functional programming languages: 
give a useful characterization of those infinite A-terms that are A| e t re c-expressible in the sense that 
they arise as infinite unfoldings of terms in A| etrec , the A-calculus with letrec. We provide two 
characterizations, using concepts we introduce for infinite A-terms: regularity, strong regularity, 
and binding-capturing chains. It turns out that Aietrec-expressible infinite A-terms form a proper 
subclass of the regular infinite A-terms. In this paper we establish these characterizations only 
for expressibility in A M , the A-calculus with explicit /i-recursion. We show that for all infinite 
A-terms T the following are equivalent: (i): T is A M -expressible; (ii): T is strongly regular; (hi): T is 
regular, and it only has finite binding-capturing chains. 

We define regularity and strong regularity for infinite A-terms as two different generalizations 
of regularity for infinite first-order terms: as the existence of only finitely many subterms that are 
defined as the reducts of two rewrite systems for decomposing A-terms. These rewrite systems act 
on infinite A-terms furnished with a bracketed prefix of abstractions for collecting decomposed 
A-abstractions and keeping the terms closed under decomposition. They differ in which vacuous 
abstractions in the prefix are removed. 

This report accompanies the article [9], and mainly differs from that by providing the proof 
of the characterization of A^-expressibility with binding-capturing chains. 
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\q 1 Introduction 

iL A syntactical core of functional programming languages is formed by A| e trec, the A-calculus 

with letrec, which can also be viewed as an abstract functional language. Formally, A| e trec is 
the extension of the A-calculus by adding the construct letrec for expressing recursion as well 
^>- as explicit substitution. In a slightly enriched form (of e.g. Haskell's Core language) it is used 

as an intermediate language for the compilation of functional programs, and as such it is the 
basis for optimizing program transformations. A calculus that in some respects is weaker 
than Aietrec is A^, the A-calculus with the binding construct /x for /it-recursion. Terms in A M 
can be interpreted directly as terms in A| etre c (expressions fif.M(f) as letrec / = M(f) in /), 
but translations in the other direction are more complicated, and have weaker properties. 

For analyzing the execution behavior of functional programs, and for constructing program 
transformations, expressions in A| e t re c or in A^ are frequently viewed as finite representations 
of their unfolding semantics: the infinite A-term that is obtained by completely unfolding all 
occurring recursive definitions, the letrec- or /^-bindings, in the expression. 

In order to provide a theoretical foundation for such practical tasks, we aim to understand 
how infinite A-terms look like that are expressible in A| etrec or in A p in the sense that they 
are infinite unfoldings of expressions from the respective calculus. In particular, we want to 
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obtain useful characterizations of these classes of infinite A-terms. Quite clearly, any such 
infinite A-term must exhibit an, in some sense, repetitive structure that reflects the cyclic 
dependencies present in the finite description. This is because these dependencies are only 
'rolled out', and so are preserved, by a typically infinite, stepwise unfolding process. 

For infinite terms over a first-order signature there is a well-known concept of repetitive 
structure, namely regularity. An infinite term is called 'regular' if it has only a finite number 
of different subterms. Such infinite terms correspond to trees over ranked alphabets that 
are regular [5] . Like regular trees also regular terms can be expressed finitely by systems of 
recursion equations [5], by 'rational expressions' [5) Def. 4.5.3] which correspond to /z-terms 
(see e.g. [5]), or by terms using letrec-bindings. In this context finite expressions denote 
infinite terms either via a mathematical definition (a fixed-point construction, or induction 
on paths) or as the limit of a rewrite sequence consisting of unfolding steps. Regularity of 
infinite terms coincides, furthermore, with expressibility by finite terms enriched with either 
of the binding constructs /i or letrec. It is namely well-known that both representations are 
equally expressive with respect to denoting infinite terms, because a representation using 
letrec's can also be transformed into one using //'s while preserving the infinite unfolding. 

For infinite A-terms, however, the situation is different: A definition of regularity is less 
clear due to the presence of variable binding. And there are infinite A-terms that are regular 
in an intuitive sense, yet apparently are not A| etre c- or A M -expressible. For example, the syntax 
trees of the infinite A-terms T in Fig. [T]and U in Fig. [5] both exhibit a regular structure. But 
while T clearly is A M - and A| etre c-expressible (by /if.Xxy.fyx and letrec / = Xxy.fyx in /, 
respectively), this seems not to be the case for U: the A- bindings in U are infinitely entangled, 
which suggests that it cannot be the result of just an unfolding process. Therefore it appears 
that the intuitive notion of regularity is too weak for capturing the properties of A M - and of 
Aietrec-expressibility. We note that actually these two properties coincide, because between 
A M -terms and A| e trec-terms similar transformations are possible as between representations 
with fi and with letrec of infinite first-order terms (but this will not be proved here). 

It is therefore desirable to obtain a precise, and conceptually satisfying, definition of 
regularity for infinite A-terms that formalizes the intuitive notion, and that makes it possible 
to prove that A M -/A| e trec-expressible infinite A-terms form only a proper subclass of the regular 
ones. Furthermore the question arises of whether the property of A M -/A| etre c-expressibility 
can be captured by a stronger concept of regularity that is still natural in some sense. 

We tackle both desiderata at the same time, and provide solutions, but treat only the case 
of A M -expressibility here. We introduce two concepts of regularity for infinite A-terms. For 
this, we devise two closely related rewrite systems (infinitary Combinatory Reduction Systems) 
that allow to 'observe' infinite A-terms by subjecting them to primitive decomposition steps 
and thereby obtaining 'generated subterms'. Then regular, and strongly regular infinite 
A-terms are defined as those that give rise to only a finite number of generated subterms 
in the respective decomposition system. We establish the inclusion of the class of strongly 
regular in the class of regular infinite A-terms, and the fact that this is a proper inclusion (by 
recognizing that the A-term U in Fig. p] is regular, but not strongly regular). As our main 
result we show that an infinite A-term is A^-expressible (that is, expressible by a term in A M ) 
if and only if it is strongly regular. Here we say that a term M in A^ expresses an infinite 
A-term V if V is the infinite unfolding of M. An infinite unfolding is unique if it exists, and 
can be obtained as the limit of an infinite rewrite sequence of unfolding steps. 

This expressibility theorem is a special case of a result we reported in [SJ , which states 
that strong regularity coincides with A| etrec -expressibility That more general result settles a 
conjecture by Blom in [3] Sect. 1.2.4]. Its proof is closely connected to the proof of the result 
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on A^-expressibility we give here, which exhibits and highlights all the same features, but 
lacks the complexity that is inherent to the formal treatment of unfolding for terms in A| etre c • 

Additionally we give a result that explains the relationship between regularity and strong 
regularity by means of the concept of 'binding-capturing chain' : a regular infinite A-terms is 
strongly regular if and only if it does not contain an infinite binding-capturing chain. 

This report is associated with the article [5] in the proceedings of RTA 2013. It extends 
that article by providing more details on Section HI and it contains some changes in the 
exposition of the proof in Section ^1 Furthermore it contains some changes of notatiorr] as 
well as modifications and corrections of details. Also closely related is the report [S] about 
the more general case of expressibility in A| etr ec- 

Overview. In Section [2] we introduce rewriting systems (infinitary CRSs) for decomposing 
A-terms into their generated subterms. By means of these systems we define regularity 
and strong regularity for infinite A-terms. In Section [3] we provide sound and complete 
proof systems for these notions, that play a vital role for the proof of the main result in 
Section [5] later. In Section [4] we develop the notion of binding-capturing chain in infinite 
A-terms, and show that strong regularity amounts to regularity plus the absence of infinite 
binding-capturing chains. In Section [5] we establish the correspondence between strong 
regularity and A M -expressibility for infinite A-terms. In the final Section[6]we place the results 
presented here in the context of our investigations about sharing in cyclic A-terms. 

2 Regular and strongly regular infinite A-terms 

In this section we motivate the introduction of higher-order versions of regularity, and 
subsequently introduce the concepts of regularity and strong regularity for infinite A-terms. 
For higher-order infinite terms such as infinite A-terms, regularity has been used with as 
meaning the existence of a first-order syntax tree with named variables that is regular (e.g. 
in J2JQ])- For example, the infinite A-terms T and U from Figures II] and [2] are regular in 



For example, the symbol used for the version of the A-calculus with abstraction prefixes defined in 
Section [2] has been changed from (A)°° in 9 to (A)A°° here. Similarly, the symbol used for a proof 
system denned in Section pj] that is sound and complete for such terms has been changed from (A)°° in 
[2] to (A)A~ here. 
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this sense. However, such a definition of regularity has the drawback that it depends on a 
first-order representation (as syntax trees with named abstractions and variables) that is not 
invariant under a-conversion, the renaming of bound variables. Note that the syntax trees of 
T and U have renaming variants that contain infinitely many variables, and that for this 
reason are not regular as first-order trees. It is therefore desirable to obtain a definition of 
regularity that uses the condition for the first-order case but adapts the notion of subterm to 
A-terms, and that pertains to a formulation of infinite A-terms as higher-order terms. 

Viable notions of subterm for A-terms in a higher-order formalization require a stipulation 
on how to treat variable binding when stepping from a A-abstraction Xz.V into its body V. 
For this purpose we enrich the syntax of A-terms with a bracketed prefix of abstractions 
(similar to a proof system for weak /^-equality in [SJ Fig. 12]), and consider (Xz)V as a 
'generated subterm' of Xz.V, obtained by a A-abstraction decomposition applied to ()Xz.V, 
where () is the empty prefix. An expression (Aaq . . .x n )T represents a partially decomposed 
A-term: the body T typically contains free occurrences of variables that in the original A-term 
were bound by A-abstractions but have since been split off by decomposition steps. The role 
of such abstractions has then been taken over by abstractions in the prefix (Axi . . . x n ). In 
this way expressions with abstraction prefixes are kept closed under decomposition steps. 

We formulate infinite A-terms and their prefixed variants as terms in iCRSs (infinitary 
Combinatory Reduction Systems) for which we draw on the literature. By iCRS-terms we 
mean a-equivalence classes of iCRS-preterms that are defined by metric completion from 
finite CRS-terms |12j . For denoting and manipulating infinite terms we use customary 
notation for finite terms. In order to simplify our exposition we restrict to closed terms, but 
at one stage (a proof system in Section |5| we allow constants in our terms. 

Note that we do not formalize /^-reduction since we are only concerned with a static 
analysis of infinite A-terms and later with finite expressions that express them via unfolding. 

► Definition 1 (iCRS-representation of A°°). The CRS-signature for the A-calculus A and the 
infinitary A-calculus A°° consists of the set Ea = {app, abs} where app is a binary and abs a 
unary function symbol. By Ter(A°°) we denote the set of infinite closed iCRS-terms over T,\ 
with the restriction that CRS-abstraction can only occur as an argument of an abs-symbol. 
Note that we restrict attention to closed terms, and that here and below we subsume finite 
A-terms among the infinite ones. 

► Example 2. The A-term Xxy.yx in CRS-notation is abs([x]abs([y]app(y,cc))). 

► Definition 3 (iCRS-representation of (A)A°°). The CRS-signature £(a)a f° r (^)^°°? the 

version of A°° with bracketed abstractions, extends £> by unary function symbols of ar- 
bitrary arity: £(a)a = £a u {pre„ |neN}. Prefixed A-terms pre„([2q] . . . [a; n ]T) will in- 
formally be denoted by (Xxi . . .x n )T, abbreviated as (Xx)T, or ()T in case of an empty 
prefix. By Ter((A)A°°) we denote the set of closed iCRS-terms over £(a)a °f * ne f° rm 
pre„([xi] . . . [a;„]T) for some n e N and some term T over the signature Sa with possible 
free occurrences of X\, . . . , x n , and the restriction that a CRS-abstraction can only occur as 
an argument of an abs-symbol. 

► Example 4. The term pre 1 ([x]abs([y]app(y,x))) in Ter((A)A°°) can be written, in in- 
formal notation, as the prefixed A-term (Xx) Xy.y x. 

On these prefixed A-terms, we define two rewrite strategies -^rog and ->-rcg + that deconstruct 
infinite A-terms by steps that decompose applications and A-abstractions, and take place just 
below the marked abstractions. They differ with respect to which vacuous prefix bindings 
they remove: while -> reg -steps drop such bindings always before steps over applications 
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and A-abstractions, -^ reg +-steps remove vacuous bindings only if they occur at the end of 
the abstraction prefix. These rewrite strategies will define respective notions of 'generated 
subterm', and will give rise to two concepts of regularity: a A-term is called regular/strongly 
regular if its set of -^ lcg -reachable/^>- rcg +-reachable generated subterms is finite. 

► Definition 5 (decomposing (A)A°°-terms with rewrite strategies -*- reg and -*- re g+)- We consider 
the following CRS-rules over S/^ in informal notationrl 



(q 5 ) 



(Xxi...x n )T Ti ->■ (Xxi ...x n )Ti (ie {0,1}) 

(Axi ...x n )Xx n+ i.T ->■ (Xxi ...x n+ i)T 

(Xxi . . . x n+ i)To -*■ (Xxi . . . x u )Tq (if binding Xx n+ i is vacuous) 

(Xxi . . . x h+ i)Tq -*■ (Xxi . . . Xi-\Xi + i . . . x n+ \)To (if bind. Xxi is vacuous) 



We call an occurrence o of a binding like a A-abstraction Xz or a CRS-abstraction [z] in a 
term V vacuous if V does not contain a variable occurrence of z that is bound by o. 

The iCRS with these rules induces an ARS (abstract rewriting system) A on infinite 
terms over S(a)a- By (A) we denote the sub- ARS of A with its set of objects restricted to 
Ter((A)A°°). Note that Ter((A)A°°) is closed under steps in (A). By -*-@ , -*® i: -*-\, -*s, 
-►del we denote the rewrite relations induced by (jl)-steps with respect to rules g®°, g® 1 , g x , 
g s , g dcl . We define Reg {Reg*) as the sub-ARS of (A) that arises from dropping steps that 
are: 

i due to g s (p dcl ), so that the prefix can be shortened only by p dol -steps (£> s -steps). 

i due to rules other than g del (g s ) but whose source is also a source of a g dcl -step (g> s -step). 
Reg {Reg*) is g dcl -eager (g s -eager) in the sense that on each path g del -steps (p s -steps) occur 
as soon as possible. We denote by ^ rcg (^ re g+) the rewrite strategy induced by Reg {Reg*)r\ 

► Example 6. Using the recursive equation T = Xxy.Tyx as a description for the infinite 
A-term T in Fig.fTl we find that decomposition by -» rog +-steps proceeds as follows, repetitively: 

(Xxv)T V {Xxy)T {Xx)T ()T ■•■ 
()T {Xx)Xy.Tyx {Xxy)Tyx {Xxy)y 

{Xxy)x {Xx)x 

(in a tree that branches to the right) . Note that removal steps for vacuous bindings take place 
only at the end of the prefix. See Fig. n\ right for the reduction graph of ()T with displayed 
sorts of decomposition steps. Although ->s-steps also are ^del-steps, this decomposition is 
not also one according to ->rcg , because e.g. the step {Xxy)Ty -►@ 1 {Xxy)y is not g -eager. 

The rules g are related to the de Bruijn notation of A-terms. Consider Xx.{Xy.xx) x 
which in de Bruijn notation is A.(A.l 1) and when using Peano numerals A. (A. 5(0) 5'(0)) 0. 
Now if the symbols S are allowed to appear 'shared' and occur further up in the term as in 
A. (A. 5(0 0)) 0, then this term structure corresponds to the decomposition with -> rog +. 

To understand the difference between ->rog and -^ rcg + , consider the notions of scope and 
scope + , illustrated in Figures [l] and [5] The scope of an abstraction is the smallest connected 
portion of a syntax tree that contains the abstraction itself as well as all of its bound variable 
occurrences. And scope + s extend scopes minimally so that the resulting areas appear properly 
nested. For a precise definition we refer to [SI Sect. 4]. As can be seen in Figures [l] and [2] 
applications of p del {g s ) coincide with the positions where scopes (scope + s) are closed. 



E.g. explicit form of scheme (g s ): pre„ +1 ([ii . . .x„+i]Z(xi, ... ,x n )) -*■ pre n ([xi . . .x n ]Z(xi,. ..,!„)). 
We use 'rewrite strategy' for a relation on terms, and not for a sub-ARS of a CRS-induced ARS |15| . 
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Figure 2 The regular infinite A-term U that is not strongly regular, and not A M -expressible. 



► Definition 7 (regular/strongly regular A-terms, generated subterms). Let T eTer(X x> ). We 
define the sets ST{T) and ST (T) of generated subterms of T w#/l respect to -^ IC g and -^• rC g+: 

ST(T) := {U e Ter((A)A~) | ()T ^ rcg U} ST + (T) := {U e Ter((A)A~) | ()T - rcg+ J7} 

We say that T is regular (strongly regular) if T has only finitely many generated subterms 
with respect to -^rcg (respectively, with respect to ->-rcg+)- 



► Example 8. From the -^ r0 g+-decomposition in Example[6]and Fig.lllof the infinite A-term T 

in Fig.nlit follows that ST + (T) consists of 9 generated subterms. Hence T is strongly regular. 

The situation is different for the infinite A-term U in Fig.[2j When represented as the 



term Xx.R(x) together with the CRS-rulc R(X) 

(Xxy)R(y) (Xxyz)R(z)y 



► rcg +-decomposition is: 

(Xxyzu)R(u) z 
(Xxy)y 

sg + -generated subterms, 



\y.R(y)X, its - 

(Xxyz) R(z) 
()U (Xx)R(x) (Xxy)R(y)x v~»'"-™> v~»~y"v«v» (Xxyx)y 

(Xxy)x (Xx)x 

Since here the prefixes grow unboundedly, U has infinitely many -> r 
and hence U is not strongly regular. But its ^• 1 . cg -decomposition exhibits again a repetition 
as can be seen from the reduction graph in Fig. [2] on the right. Note that a vacuous binding 
from within a prefix is removed. ()U has 6 only different -> rog -reducts. Hence U is regular. 
For infinite A-terms like (Axi.Xi) (Ax1.Ax2.X2) (Ax1.Ax2.Ax3.x3) . . . that do not have any 
regular pseudoterm syntax-trees, both -^■ rcg +-decomposition and -^ reg -decomposition yield 
infinitely many generated subterms, and hence they are neither regular nor strongly regular. 



For a better understanding of the precise relationship between 



and 



> reg+ , 



and 



eventually of the two concepts of generated subterm and of regularity, we gather a number 
of basic properties of these rewrite strategies and their constituents. 

► Proposition 9. The restrictions of the rewrite relations from Def. pjlto Ter((A)A°°), the 
set of objects of Reg and Reg , have the following properties: 

(i) 
(ii) 
(iii) 



►del is confluent, and terminating. 

>S £ -►del- Furthermore, ->s is deterministic, hence confluent, and terminating, 
►del one-step commutes with ->■>, -»-@ , -*-@ 1 , and one-step sub-commutes with 
'•del postpones over -^-\, -+®, , -+@, and ->s- Formulated symbolically, this means: 
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(iv) Normal forms of -»reg and ->>eg+ are of the form (Ace) x, and (Axi . . . x n ) x n , respectively, 
(v) -»reg an d ->-rog+ are finitely branching, and, on finite terms, terminating. 

Proof. These properties, including those concerning commutation of steps, are easy to verify 
by analyzing the behavior of the rewrite rules in Reg on terms of Ter((A)A°°). ■« 

► Proposition 10. (i) Let (Xx)T be a term in Ter((A)A°°) with \x\ = n e N. The number 

of terms (Xy)U in Ter((A)A~) with (Xy)U -» de} (Ax) T and \y\ = n + k e N is ( n+ n k ). 
(ii) Let A £ Ter((A)A°°) be a finite set, and k e N. Then also the set of terms in Ter((A)A°°) 
that are the form (Ay) U with \y\ < k and that have a -»-dei-reduct in A is finite. 

We state a lemma about a close connection between ->reg- and -^ reg + -rewrite sequences. 

► Lemma 11. (i) On Ter((\)\°°) it holds: <^dci ■ ^rc g + £ -^el ' ~^rc g ' *-del; where -+' del 

denotes many-step ->-dei -reduction to -»-del -normal form. As a consequence of this and 
of ^dci ' ^*rcg - "^reg; every finite or infinite rewrite sequence in Ter((A)A°°); 

T : (A5o)To ->- r eg+ (Xxi)Ti ^ rcg + • . . ^rcg+ (Xxk)Tk -»-reg+ • • ■ 

projects over a sequence ir : (Xxq)Tq -»dci (Xx' q )Tq to a rewrite sequence of the form: 

t : (Xx' ) T -» reg (Xx[)Ti -» reg . . . -» mg (Xx' k )T k -^ rog . . . 

in the sense that (Xxk)T k -»dei (A5' fe )Tfe for all k € N Zess or equal to the length of r. 
(zz,) On Ter((A)A°°) it holds: -»dci • "^rcg £ ^s ' _> rc^+ ' ""'del ■ -Owe to i/iz's and -*■§ ■ -^" cf r+ £ 
-» reg + , every rewrite sequence r : (Xx' Q )To -^ re g (A^)Ti -^ rcg . . . -^ re g (Ax' fc )Tfc -^ rcg . . . 
in Ter((A)A°°) lifts over a sequence ir : (A5o)2o -»dd (ASq)To to a -» rog + -rewrite 
sequence of the form: f : (Xxq)Tq -» reg + (At?i)Ti -» re g+ • • ■ -»rcg+ (A5fc)Tfc -» re g+ • ■ • 
in the sense that (Xxk)Tk -»dci (Xx' k )Tk for all k € N Zess or egwaZ to i/ie length of t. 

Proof. The inclusion properties in (m) and El can be shown by easy arguments with diagrams 
using the commutation properties in Proposition^] dm|, as well as M and (pl| from there. ■* 

Now we are able to establish that strong regularity implies regularity for infinite A-terms. 

► Proposition 12. Every strongly regular infinite A-term is also regular. Finite A-terms are 
both regular and strongly regular. 

Proof. Let T be a strongly regular infinite A-term. Therefore ST (T) is finite. Since every 
-> rcg -rewrite-sequence from ()T lifts to a -» rC g+ -rewrite-sequence from ()T over -»dcrcom- 
pression due to Lemma |ll[ my, every term in ST(T) is the -»dci~compression of a term in 
ST + (T). Then it follows by Proposition[i~0J @, that also ST(T) is finite. Hence T is also 
regular. 

Let T be a finite A-term. Due to to Proposition]^ Q, Kdnig's Lemma can be applied 
to the reduction graph of ()T with respect to -» re g+ to yield that T has only finitely many 
generated subterms with respect to -» rog + ■ Hence T is strongly regular. ■* 

3 Proving regularity and strong regularity 

As a preparation for the proof of the main expressibility result in Section [5] we introduce, 
in this section, proof systems for regularity and strong regularity of infinite A-terms that 
formulate these notions in terms of derivability: the systems Reg 00 and Reg + '°° with 
typically infinite derivations, and the systems Reg, Reg , and Reg for provability by finite 
derivations. A completed derivation of ()U in Reg°° (in Reg + '°°) corresponds to the 'tree 
unfolding' of the -^ rcg -reduction graph (the -^• rcg + -reduction graph) of ()U, which is a tree 
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Figure 3 The proof system Reg + for strongly regular A-terms. In the variant system Reg^ of 
Reg + , instances of (FIX) are subject to the additional side-condition: for all (Xy) U on threads in 
Do from open marked assumptions ((Xx)T) u downwards it holds that \y\ > \x\. The systems (A) A°° 
and Reg + '°° do not contain the rule FIX. Derivations in Reg + , Regp, and Reg°° must be (S)-eager. 

that describes all -> rog -(resp. -*- reg + -)rewrite sequences from ()U. Closed derivations of ()U 
in Reg (in Reg , or Reg ) correspond to finite unfoldings of the -> rcg -reduction graph (the 
-►reg 4 -reduction graph) into a graph with only vertical sharing. 

We start by introducing proof systems for well-formed prefixed terms, that is, terms from 
the set Ter((A)A~)). 

► Definition 13 (proof systems (A)A°°, (A) A°° for well-formed A°°-terms). The proof systems 
defined here act on CRS-terms over signature £(a)a as formulas, and are Hilbert-style systems 
for finite or infinite prooftrees (of depth < to). The system (A) A°° has the axioms (0) and 
the rules (&), (A), and (S) in Fig.pl The system (A)A°° arises from (A) + A°° by replacing the 
axioms (0) and the rule (S) with the axioms (0) and the rule (del) in Fig. El respectively. 

A finite or infinite derivation T in (A)A°° (in (A) A°°) is called closed if all terms in leafs 
of T are axioms. Derivability of a term (Xx) T in (A) A°° (in (A) A°°), denoted symbolically 
by i-/ A \ A ~ (Xx)T (resp. by i-( A \+ A ~ (Xx)T), means the existence of a closed derivation with 
conclusion (Xx)T. 

We say that a proof system S is sound (complete) for a property P of infinite A-terms if 
h 5 ()T implies P(T) (if P(T) implies \- s ()T) for all infinite A-terms T e Ter(X°°). 

► Proposition 14. (A)A°° and (A) A°° are sound and complete for all infinite A-terms. What 
is more, these systems are also sound and complete for all prefixed infinite A-terms among 
all terms over signature £(a)a : for all terms W 6 Ter(E( A ) A ) it holds that i-( A ) A ~ W if and 
only if W e Ter((A)A°°) (and hence W is of the form (Xu) W). 

Proof. For completeness of these systems note that every prefixed term (Ay) U e Ter((A)A°°) 
with U not a variable is the conclusion of an instance of a rule in these systems. < 

Next we define proof systems for proving regularity and strong regularity of infinite 
A-terms by means of typically infinite derivations. 

► Definition 15 (proof systems Reg°°, Reg + '°°). The proof systems Reg°° and Reg + '°° have 
the same axioms and rules as (A)A°° and (A) A°°, respectively, but they restrict the notion of 
derivability. A derivation T> in (A)A°° (in (A) A°°) is called admissible in Reg°° (in Reg + '°°) 
if it contains only finitely many different terms, and if it is (del) -eager ((S) -eager), that is, 
if no conclusion of an instance of (@) or (A) in V is the source of a ^doi-step (a ->-s-step). 
Derivability in Reg°° (in Reg + '°°) means the existence of a closed admissible derivation. 

For Reg°° and Reg + '°° we easily obtain the following soundness and completeness results. 

► Proposition 16. (i) Reg°° is sound and complete for regularity of infinite A-terms. 
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(Axi . . . Xi-iXi + i . . . X n ) 1 (if the binding 



(Xy)y (Axi...a: n )T 



Xxi is vacuous) 



Figure 4 The proof system Reg for regular A-terms arises from Reg + through replacing the 
rule (S) by the rule (del), and the axiom scheme (0) by the more restricted version here. The 
systems (A)A°° and Reg°° do not contain the rule (FIX). Derivations in Reg and Reg°° must be 
(del)-eager. 

(ii) Reg '°° is sound and complete for strong regularity of infinite A-terms. 

Proof. We argue only for ml, since (|I| can be seen analogously. Every (S)-eager derivation T 
in (A) A°° with conclusion ()T assembles the maximal -> rcg +-rewrite sequences from ()T in 
the following sense: the steps of every such rewrite sequence correspond to the steps through 
T along a thread from the conclusion upwards. Therefore if T is an admissible derivation in 
Reg + '°°, and hence contains only finitely many terms, then ST (T) is finite. Since every 
term ()T in Ter((\)\) has a (S)-eager derivation in (A) A°°, the converse holds as well. « 

Finally we introduce proof systems for proving regularity and strong regularity of infinite 
A-terms by means of finite derivations. Derivations in these systems are able to detect the 
cyclic structure of a regular or strongly regular A-term, and correspondingly, cyclicity in -*- r eg- 
and -^ rcg + -rewrite sequences that decompose the term. These proof systems are reminiscent 
of coinductively motivated proof systems such as the ones for recursive type equality and 
subtyping by Brandt and Henglein [3] (proof-theoretic connections with more traditional 
proof systems have been studied in [7]). 

► Definition 17 (proof systems Reg, Reg + , and Reg^). The natural-deduction style proof 
system Reg + has the axioms and rules in Fig.pl Its variant Reg^ demands an additional 
side-condition on instances of the rule (FIX) as described there. The system Reg arises from 
Reg + by dropping the rule (S), and restricting the axioms to the axioms (0) in Fig. Pi] 

A derivation in one of these systems is called closed if it does not contain any undischarged 
marker assumptions (discharging assumptions is indicated by assigning the appertaining 
assumption markers to instances of FIX, see Fig.K3j). Derivability in Reg (in Reg + or in 
Regp) means the existence of a closed, (del)-eager ((S)-eager), finite derivation. 

The proposition below explains that the side-condition '\T>q\ > V on subderivations of 
FIX-instances guarantees a 'guardedness' property for threads in derivations in these systems. 

► Proposition 18. Let V be a derivation in Reg, Reg + , or RegQ. Then for every instance i 
of the rule (FIX) in T> it holds: every thread from i upwards to a marked assumption that is 
discharged at i passes at least one instance of a rule (A) or (@). 

Proof. Let T> be a derivation in Reg, as the argument is analogous for Reg + and Regg. Let 

i be an instance of (FIX) in T>, and ir a thread from the conclusion (Ay) U of c to a marked 
assumption ((Xy) U) that is discharged at t. Then due to the side-condition on the topmost 
instance k of (FIX) passed on ir there is at least one instance of a rule (A), (@), or (del) 
passed on ir above n. We are done unless that is an instance of (del). But then there must 
also be an instance of (A) on n, since (del) decreases the prefix length, only (A) increases it, 
and the prefix lengths in the formula at the start and at the end of n are the same. •« 
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► Example 19. (i) The following are two derivations in Reg + of different efficiency of the 
infinite A-term T from Fig. [I] when represented by the recursive equation T = Xxy.Tyx: 

mT y ((Xx)Xy.Tyx) 1 

({)I) S ()T 

{Xx)T 5 u S 



S -77 — ^0 ,, . (Xx)T 



(Xxy)T (Xxy)y ^ (Xx)x \ '—- S 

■S (Xxy)T (Xxy)y (Xx)x 



(Xxy)Ty (Xxy)x tt >„ @ A \ S 
@ (Xxy)Ty (Xxy)x 



(Xxy)Tyx 



X (Xxy)Ty 



x 



(Xx)Xy.Tyx \ »' " A 

A (Xx)Xy.Tyx 



() Xxy.Tyx -, — ^ — — FIX,/ 

U y y F]XI (Xx)Xy.Tyx 

()r A 

Note that only the left derivation is one in Rego, because the right one contains a term 
with shorter prefix than the discharged assumption on a thread to the instance of FIX. 
(ii) The infinite A-term from Fig. [2] denoted by the term ()Xx.R(x) and generated by the 
CRS-rule R(X) -*■ Xx.R(x) X is derivable in Reg by the closed derivation on the left, 
but it is not derivable in Reg + : 





(Xxyzuv)R(v)u (Xxyz)z 

* q (Xxyzu)Xv.R(v)u (Xxyzu)z 

((Xy)R(y)Y ^ (Xx)x ^ {Xxyzu) R(u) z ~ _ ® (Xxyfy ° 

( Xx v) R (y) ( XX V) X @ {Xxyz)Xu.R(u)z X (Xxyz)y S 

(Xxy)R(y)x ^ (Xxyz)R(z)y ® (Xx)x ° 

(Xx)Xy.R(y)x_ (Xxy) Xz.R(z)y A (A^ ^ 






()Aa;.E(a;) 



A 



The latter follows from the infinite prooftree on the right, the result of a bottom-up proof 
search in Reg + , which is a derivation in (A) A°° but not in Reg + '°°, since, as it does 
not contain repetitions, the rule FIX cannot be used to cut off repetitive subderivations. 

Finally, we can link derivability in Reg and Reg + to regularity and strong regularity. 

► Theorem 20. (i) Reg is sound and complete for regularity of infinite X-terms. 
(ii) Reg + and Regg are sound and complete for strong regularity of infinite X-terms. 



Proof. For (jib , in view of Proposition 16 (m), it suffices to be able to transform closed, 
admissible derivations in Reg°° into closed derivations in Reg, and vice versa. Every closed 
derivation V in Reg can be unfolded by a stepwise, typically infinite process into a closed 
derivation in (A)A°°: in every step the subderivation of a bottommost instance i of FIX 
is transferred to above each of the marked assumptions that are discharged at l, and the 
original instance of FIX is removed. If this process is infinite, then due to Proposition[T8| 
it always eventually increases the size of the part of the derivation below the bottommost 
occurrences of FIX. Hence in the limit it produces a closed, (del)-eager derivation in (A)A°° 
that contains only finitely many terms (only those in T>) , and thus is admissible in Reg°°. 
Conversely, every admissible, closed derivation T in Reg°° can be 'folded' into a finite closed 
derivation in Reg by introducing FIX-instances to cut off the derivation above the upper 
occurrence of a repetition. This yields a finite derivation since due to admissibility of T in 
Reg°° every sufficiently long thread contains a repetition, and then Konig's Lemma can be 
applied. 
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For Reg + in (ml) it can be argued analogously, using Proposition |l6| (pu), and unfolding/- 
folding between closed derivations in Reg + and closed, admissible derivations in Reg + '°°. 
Soundness of Rego follows from soundness of Reg + . For completeness of Rego, note that 
every closed, admissible derivation T in Reg + '°° can be 'folded' into a closed derivation of 
Rego by using a stricter version of repetition of terms: distinct occurrences of a term (Xy) U 
on a thread of a prooftree form such a repetition only if all formulas in between have an 
equally long or longer abstraction prefix. Since T is admissible, on every infinite thread 9 of 
T there must occur such a stricter form of repetition, namely of a term with the shortest 
abstraction prefix among the terms that occur infinitely often on 0. ■* 

4 Binding-Capturing Chains 

In this section we develop a characterization of strongly regular infinite A-terms through a 
property of their term structure, concerning 'binding-capturing chains' on positions of the 
term. While not needed for obtaining the result concerning A| e trec-expressibility in Section [5] 
we think that this characterization is of independent interest. 

Binding-capturing chains originate from the notion of 'gripping' due to Mellies [T3], and 
from techniques concerning the notion of 'holding' of redexes developed by van Oostrom [13] . 
In |6J they have been used to study a-conversion-avoiding //-unfolding. 

Technically, binding-capturing chains are alternations of two kinds of links between 
positions of variable occurrences and A-abstractions (called binders below) in a A-term: 
'binding links' from a A-abstraction downward to the variable occurrences it binds, and 
'capturing links' from a variable occurrence upward to A-abstractions that do not bind it, 
but are situated on the upward path to its binding A-abstraction. We formalize these links 
by binding and capturing relations, which are then used to define binding-capturing chains. 

► Definition 21 (binding, capturing). Let T € Ter(A°°). On the set Pos(T) of positions of T 
(for positions in iCRS-terms, see |12j ) we define two binary relations: the binding relation °— , 
and the capturing relation -+. Since these relations are specific to the term T, they could 
be denoted by °— y, and -*y, respectively. However, we will generally avoid this subscript 
notation, assuming that the underlying infinite A-term will always be clear from the context. 
For defining °— and -* for T, let p, q e Pos(T). 

p °— q (in words: a binder (A-abstraction) at p binds a variable occurrence at q) holds if p 
is a binder position, and q a variable position in T, and the binder at position p binds the 
variable occurrence at position q. 

q -*■ p (in words: a variable occurrence at q is captured by a binder at p), and conversely 
p *- q (the binder at p captures a variable occurrence at q), hold if q is a variable position and 
p < q a binder position in T, and there is no binder position q in T with p < q and q Q <>— q. 

► Definition 22 (binding-capturing chain). Let T e Ter(\°°). A finite or infinite sequence 
{po, q%,pi, qiiVi) • • ■) in {0, 1} is called a binding- capturing chain in T if p$, qi,Pi,q2,P2, ■ ■ • 6 
Pos(T), and if these positions are linked alternatingly via binding and capturing: p\ &- (72 ~* 
Pi °— 93 -+ P3 ^~ ■ ■ •) starting with a binding and ending with a capturing. The length of such 
a binding-capturing chain is the number of 'is captured by' links. 

See Figs.[T]and|2]for illustrations of binding-capturing chains in terms we have encountered. 
Note that binding-capturing chains occur whenever scopes overlap, or in other words when 
nesting scope + s occur. Every binding-capturing chain is fully contained within a scope" 1 ". 

Now we introduce a position-annotated variant Reg pos of Reg in order to relate binding- 
capturing chains to rewrite sequences in Reg . The idea is that if a A-term T has a generated 
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subterm (Xyi . . . y n ) U in Reg + , then (Xyi . . . y n ) q pi p U is a generated subterm in Reg pos , 
where p\,...,p n are the positions in T from which the bindings Xy\ . . . y n in the abstraction 
prefix descend, and q is the position in T of the body U of the generated subterm. 

► Definition 23 (iCRS-representation of (A) pos A°°, terms in Ter((A) pos A°°)). The CRS-sig- 
nature for (A) A°°, the A-calculus with position-annotated abstraction prefixes is given by: 

E (A) pos A = S A u {pre q {pu pn) | n e N, Pl , . . . ,p n , q e {0, 1}*} 

where all of the function symbols pre? . are unary. Terms that are of the specific 

form pre? )([^i] ■ ■ ■ [^n]? 1 ) will be denoted in informal notation as (Xx± . . ■ x n ) q T, 

which can be abbreviatee to (Ax)?T, or to ()T in case of an empty prefix. By Ter((A) pos A°°) 
we denote the set of closed iCRS-terms over S(^) oa >, of the form pre? ^([^l] ■ • • [^njT) 

for some n e N, pi,. . . ,p n ,q e {0,1}* and some term abs([x 1 ] . . . abs([x n ]T)) over the 
signature Y,\ with the restriction that a CRS-abstraction in T can only occur as an argument 
of a function symbol abs. 

► Definition 24 (position-annotated variant Reg*). On Ter((A)A°°) we consider the following 
rewrite rules in informal notation: 



y^pos ) 

(Ppos) 
\cpos ) 



{\x 1 ...x n )l up J T 1 ^{Xx 1 ...x n )'f u>pn T i (for each i e {0,1}) 

(Axi . . . x n )l u Pn Xy.T -+ (Xxi . . . x n y)f° \.. iPn>q T Q 

(Xxi . . .x n+1 ) q piPn+i T Q ^ (Xxi . . .x n ) q pu Pn T Q (if bind. Xx n+ i is vacuous) 



The change of the term-body position in a A-decomposition step is motivated by the underlying 
CRS-formalization of terms in (A)A°°: when a subexpression abs([y]To) of a term in 
Ter((A) S A°°) that represents a A-abstraction starts at position q, its binding is declared 
at position qO, and its body Tq starts at position qOO. 

By Reg pos we denote the abstract rewriting systems induced, similar to the definition of 
Reg + in Def. pearlier, by the rules above on position-annotated terms in Ter((A)A°°). 

Also analogously to Def. 5 by -^ rcg + we denote the g pos -ea,gei rewrite strategy for Reg pos . 

The lemma below gathers basic properties of the rewrite relation -> r0 g+ on position-anno- 
tated prefixed A-terms, and statements about the form of possible -» reg +-rewrite sequences. 

► Lemma 25. (i) If ()\\T -» reg + (Xx)lU , then there is n e N such that x = {x\ ...x n ), 

p = (pi,...,p n ), pi,...,p n ,qt Pos(T), and pi < p 2 < . . . < p n < q- 
(ii) If (Xyi...y„) q qi qn Ui «- reg + (Xxi...x n )l° lt ... tPn U -» reg + (Azi . . . z m )^ _ Tm U 2 holds 

for some q e {0,1}*, then (Xyi . . . y„) q qu ^Ui = {Xz x . . . z m ) q ru Tm U 2 follows, and 

hence also n = m, and qi = n, . . . , q n = T m . 
(Hi) If t : (Axi . . . Xn) q U -» reg + (Xxi . . . x n ') q , , U' is a rewrite sequence in Reg pos , 

and no = max {i \ 1 < i < n, pi = p[ , . . . , pi = p[} (< min {n, n'}) then t is of the form: 

t : (Axi . . . x n ) q pu pn U -» reg+ (Xxi . . . *n )p°,... lPno V -^ rcg+ (Xxi . . . x n >) q p , ip ,^ U' 

for some q$ such that q < qo < p' n +i < • • • < p' n i ■ 

(iv) For every rewrite sequence r : (Xxi . . . x n ) q puPn U ^ rcg + (Xxi . . . x n+m+ i) q pi ^ Pn+m+l V 
is of the form: 

T : (Axi . . . I n ) pi pn U -^>'reg + \Xx\ . . . X n+m ) Pl p fl+m XXn+m+l-'O 



p„ +m+ i00 



► reg + {\xi ■■■x n + m +i) Pli ..^ Pn+m+1 vc 



(I 
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{>^xi...x n y) q pipnr y 



{\ Xl ...x n y)l u pnq T 
(Xx!...x n ) q pi pn Xy.T 



(\x 1 ...x n - 1 ) q pi jPre _T (ifthebindi 



(\x x ...x n ) q pu p T 



mg 

Xx n is vacuous) 



(\x)l u T (Xx^T, 

{Xx^TqTx 



Figure 5 The proof system (A) A" 



-»rcg+ (Xx\ . . . %n+m+l) pil ... t p n+m+1 y 

where x n+m occurs free in Xx n+m+ i. Vq, and hence also in Vq. 
Proof. Each of the statements (TO), (pi), and |iv| can be shown by induction on the length of 



the 



'rcgi 



-rewrite sequence in the assumption, distinguishing the cases of the rule applications 



in the last step. For the part of statement di), that a rewrite sequence ()1)T -» rog + (Xx)'lU 
implies p\, . . . ,p n , q e Pos(T), it suffices to show that q e Pos(T), because, due to the form 
of the rules in Reg pos , a position is added to the list in the subscript of the abstraction prefix 
(by an application of the rule g pos ) only if it has already been encountered as the position 
in the subscript of an abstraction prefix. This remaining statement can be established as 
follows: every -> rog + -rewrite sequence from (),,T explores T (or a pre-term representation of 
T) from the root position downwards, thereby keeping track of the current position in the 
superscript q of the abstraction prefix. 

Statement (HiJ can be shown similarly by induction on the sum (or the minimum) of the 
lengths of the two -^ rcg +-rewrite sequence in the assumption. ■* 

The proposition below formulates the statement that -> rcg +-rewrite sequences on terms in 
Ter((A)A°°) are related to -> rcg +-rewrite sequences on position-annotated terms via lifting 
(adding annotations) and projecting (dropping annotations). 

► Proposition 26. (i) Lifting: Every rewrite sequence r : (Xxq)Tq -> rC g+ (ASi)Ti -> r cg+ 
• ■ • ^rcg+ {Xx n )T n in Reg + can be lifted, by adding given qo € N* and po € ^* with \po\ = 
\xq\, and appropriate further position annotations q\, . . . ,q n 6 N* and p\, . . . ,p n € N*, 
to the terms of r, yielding a rewrite sequence r pos : (Axo)|°T ^- r cg+ (ASi)^Ti ^- rcg + 
. . . -+ r eg+ (Xx n )l n n T n in Reg pos . 
(ii) Projection: The result of dropping the position annotations in the prefix in a rewrite 
sequence in Reg pos is a rewrite sequence in Reg + . 

Proof. Statements (i) and (ii) can be shown by straightforward induction on the length of 
rewrite sequences in Reg and in Reg , respectively. •« 



rog + , and the position-change recorded in steps of 
> rcg + in Reg + pos have the property to be trees. As 



As a consequence of (S)-eagerness of 
-> rcg + , reduction graphs with respect to 
such they can be captured directly by prooftrees (with certain properties) in a proof system 
( A) A°° that is defined below. Recall that ^ rog + -reduction graphs in Reg + do not have tree 
shape in general, but that their tree unfoldings can be captured by completed derivations in 
the proof system Reg + '°° from Section [3] 

► Definition 27 (proof system (A) A°°). The proof system (A) A°° acts on CRS-terms 



over the signature £(a) os a as formulas, is a Hilbert-style system for finite or infinite prooftrees 
(of depth <lo), and it has the axioms and rules displayed in Fig. [5J 
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A derivation T in (A) A°° is called (S)-eager if no conclusion of an instance of (@) or (A) 
in T> is the source of a -»-s-step. A finite or infinite derivation T in (A) pos A°° is called closed 
if all terms in leafs of T are axioms. By derivability of a term (AS)? T in (A) pos A°°, which 
is denoted symbolically by i~(a) + a~ (^)l T, we mean the existence of a closed derivation 
with conclusion (Ax)? T. 

► Proposition 28. Let A-term T e Ter(X°°) be an infinite A-term. Then there is a unique 
(S)-eager, and closed derivation T with conclusion ()T in (A)p OS A°°. T corresponds directly 
to the -*- reg + -derivation graph of ()T in Reg pos , which has the form of a tree. 

► Proposition 29. The proof system (A) Pog A°° is sound and complete for all position-anno- 
tated prefixed infinite A-terms in Ter((X) S X°°), that is: for all terms W e Ter(Y,(x) as \) 
it holds that *-(\\+ a~ W if and only if W e Ter((X) X°°) (thus W is of the form 
(Xxi • . . x n ) q pu Pi W' for n c N, pi, . . . ,p n ,g e {0, 1}*, and (Xxi ...x n )W'e Ter((X)X")). 

The following proposition relates positions q in an infinite A-term T with -> reg + -rewrite 
sequences from ()i)T in Reg pos that 'access' g in T, and that in doing so eventually produce 
the 'position-annotated generated subterm of T at position q\ 

► Proposition 30. Let T e Ter(A°°) be an infinite A-term. Then for all q e {0, 1}* it holds: 

(i) If q € Pos(T) then there is a unique -^ reg + -rewrite sequence in Reg pos of the form 
()nT -» rcg + (Aaq . . . x n )~ „ C/ that proceeds via terms of the form (Aa^)?*. E/j where 
the qi are contained in, and exhaust, the set {q' \ q' < q} such that furthermore: 

(a) U corresponds to the remaining body of the A-term T at and below q , with the 
variables free in U being bound in the abstraction prefix of (Xxi . . ,i„)' U . 

(b) the position-annotated generated subterm {\x\ . . . Xn) q U of T contains the 
information on at which A-binding positions of T the free variables x\, . . . ,x n of 
U have been bound originally in T, namely: a free occurrence of Xi in U, where 
1 < i < n, descends from a variable position below q in T that is bound by a 
A-binding at position pi above q in T. 

(ii) If (Yi\T -» rcg + (Xxi . . . x n ) q U holds, then q e Pos(T) follows (and hence further 

statements described in item (m) hold as well). 

Proof (Hint). The two items of the proposition can be established by induction on the 
length of -»reg+ -rewrite sequences in Reg pos , and by induction on the length of positions, 
respectively. Thoroughly formal proofs of the statements here have to be based on the 
definition of iCRS-terms as a-equivalence classes of iCRS-preterms and the definition of 
positions in iCRS-preterms. ■* 

As an easy consequence, we obtain the following proposition. 

► Proposition 31. For all T e Ter(X°°) and positions q e Pos(T) it holds: 

(i) if q is the position of an abstraction in T, then there is a rewrite sequence in Reg* s of the 

form ()qT ^ rcg + (Xxi . . .x n ) piJln Xx n+ i.U for some n > and pi,... ,p n e Pos{T). 
(ii) if q is the position of an application in T, then there is a rewrite sequence in Reg + pos of 
the form Q\\T -» rog + (Axi . . . x n ) pi Uq U\ for some n > and pi, . . . ,p n e Pos(T). 

(iii) if q is a variable position in T, then there is a rewrite sequence in Reg pos of the form 
0\)T ^ro g + (Axi .. .x n )p lt pn X n for some n> 1 and pi,... ,p n e Pos(T). 

The next proposition describes the connection between the concepts of binding and 
capturing with position-annotated ->- rcg +-rewrite sequences. 
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► Proposition 32. For all T e Ter(A°°) and p, q e {0, 1}* it holds: 

po—q <=>• there is a rewrite sequence ()«T -» re g+ (Axi .. .!„)' x„ with p = p„ 

p *- q <=> there is a rewrite sequence 

()()T ^ro g + (Axi . . . X; . . . x n )l"™ >pi! pn C/ -» reg+ (Axi . . . x l ) q pi ^ pt x l 
such that z < n, and p = p n 

Proof (Hint). The two statements of this proposition can be established along the statements 
of Proposition |30j < 

► Lemma 33. If for some infinite X-term T we have ()(,T -» reg + (Axi . . -x n ) p U with 
Xi , . . . , x„ distinct and i e {1, . . . , n} such that Xi occurs free in the body U , then there exists 
</e{0, 1}* such that (Axi . . -^n)^, „. tPn U -»reg+ (Axi . . -Xj)^ ,„ tPi Xi and q' > q. 

Proof (Hint). The statement of the lemma can again be proved along the statement of 
Proposition [30] ■* 

The lemma below describes the connection between binding-capturing chains and posi- 
tion-annotated -> rcg +-rewrite sequences. 

► Lemma 34 (binding-capturing chains). For all T e Ter°°(A) it holds: 

(i) If () £ T -» re g+ (Axi . . -Xn)* p? Uj then pi, . . . ,p n e Pos(T), and if n > 2, there are 

q 2 ,...,q n e Pos(T) such that p ± o- q 2 -*■ p 2 °- ■ ■ ■ <— q n -* Pn- 
(ii) If pi °— q 2 -* Pi °— ■ ■ ■ °— q n -* Pn is a binding-capturing chain in T, then there exist 

positions n, . . . , r m e Pos{T) with m> n such that Q/\T -» rcg + (Axi . . . x rn ) r m r U , 

pi,...,p n e {ri,...,r m }, andpi < p 2 < . . . < p n = r m . 



Proof. We first prove statement dTb , by induction on n e N. For n = nothing has to be 
shown. In case of n = 1, for a given rewrite sequence () e T -» rcg + (Xxi) p U it has to be shown 
that p\ € Pos(T). This follows from Lemma 25 (p}. For the induction step from nton+1 we 



let n > 1, and assume a rewrite sequence of the form r : () T -»- re g+ (Axi . . . X n+ i)p U 



By Lemma 25 (livj) , r is of the form: 

r: () e T^ rcg+ (Axi . . .Xn)l"^, yPri Xxn+i-V 

^rc g + (AXi . . . x n%n+l)p u ... t p niPn+1 V 
-»reg+ (AXl . . . X„X„ + i)^ U 



(1) 



By applying the induction hypothesis to the initial segment of r formed by the rewrite steps 
in the first line of (HI), it follows that pi . . .p n e Pos(T), and that there exists a binding-captu- 
ring chain p\ °— q 2 -* p 2 o— . . . »- q n -* p n in T, for some positions q 2 , . . . , q n e Pos(T) (if n = 1, 
this binding-capturing chain has length 0). Now note that x n must occurs free in \x n+ \.V 
and in V, because otherwise the -> rog +-step displayed in (flj), which is a -+,\-step, would not be 



g -eager. Then Lemma 
holds for some q n +i > p n +i 
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implies that (Axi . . . x„+i)^ ° Pn+i V -» Ieg + (Axi . . . x n ) q pi + ^ pn X n 
00. Together with it follows:" 



() e T -^ rcg+ (Axi . •.x n+ i)^°° n+i F^ reg + (Axi . ■■Xn) 9 p T,!..,p n Xn 



From this we obtain that q n +i,p n +i € Pos(T) holds by Lemma 25 (u| , and that p n °— q n +i 
and q n+ i -* p n +\ hold by Proposition [32] With these links the already obtained binding-cap- 
turing chain can be extended to p\ °— q 2 -* p 2 °— . . . -* p n °— q n +i ~* Pn+i- Also, we have 
seen that all positions in this binding-capturing chain are in Pos{T). In this way we have 
established the induction step. 
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Second, we prove statement (m} of the lemma by induction on n, the number of binder 
positions in the assumed binding-capturing chain. 

In the base case n = nothing needs to be shown. For showing the case that n = 1, let 



Pi be a binder position in T. Then by using Proposition 31 ail, and a -> rog +-step over a 
A-abstraction we obtain a rewrite sequence of the form: 

()()T ^rcg+ (Axi .. .X m -i)ll ,...,r m ^ x "i-U ^ rcg + (Xx x . . . X m -lX m )l,^ __ .,r m -i,r m U 

for some m > 1 and T\,.. . ,r m € Pos(T) with r m = p±. This shows the statement for n = 1. 

For the induction step, we let n > 1 and p\ °— q 2 -* P2 °— ■ ■ ■ -*■ p n °— <ln+i -* Pn+i be a 
binding-capturing chain in T. By applying the induction hypothesis to the binding-capturing 
chain formed by all but the last two links, we obtain n, . . . , r mo , s e Pos(T) with too £ n and 
a rewrite sequence: 

() e T -» reg+ (Aa:i . . . x mo )^°°° mo t/ 

such that pi, . . . ,p n e {n, . . . , r mo } and pi < p 2 < . . . < p„ = r mo 



Now since p„ °— Q n +i holds, it follows by Proposition 32 that () T -» rcg + (Xyi ■ ■ ■ yk) s n+1 s Vk 



for some k > 1 and si, . . . , Sfe such that Sk = p n = r m - By Lemma 25 (jivl) , it follows: 
() e T^ rcg+ (Ayi...y fc )^ TO Sfc t/o^ reg+ (Aj/i ...j/&)^. ]Sk t/ fe 



with some [//,. Then Lemma 



\9n+l 



25 (|ii|, entails (Ayr ...J/jfe)** ,..., Sfc #o = ( Aa; i ■ ■ ■ ^o)I™°.., r „ ^o, 



(Ayi . . . yfc) " +1 Sl yk = (Axi . . . x mo ) " +1 x mo , and hence k = m . Thus we obtain: 

l 3***) ft !)•••) m.Q 

O'T ^rcg+ (Axi . . . a;mo)r™° ,r mo ^0 ^rcg+ (A*l . . . Imo)^.!,^ X ™o (3) 

Due to the last link q„ + i -* p„+i in the assumed binding-capturing chain there exists, in view of 
Proposition 



32 



SiOO T r /, \<3ra+l 



a rewrite sequence () T ^ rcg+ (A^i •••Zl) au ... >ai V -» reg + (Azi • ••^o)^--,^ Z; ° 
for some si, . . . , s; with si = p n +i, and 1 < Iq < I. By Lemma [25l (Ih]), it follows from this 



■ '■a 



rewrite sequence and the one in d3| that (Xz\ . . . zi ) s " +1 g zi = (Xx\ . . . x mo )l" 
l = to , and (Xz 1 ...zi) s ' V = (Xz x . . . zi) Sl V. Hence we obtain: 

() e T^ rcg+ (Az 1 ...Z;)ri°.°. , rm „,,, tl /^rogt ( AXl . . . X mQ )£ ** X mQ (4) 

with s; = p n+ i. Now we let m := I, r mo+ i := S mo+ i, . . . , n := S; (note that thus ri = s t = p n +i) 
and (Axi . . .1™)!™ [/ := (Azi . . . zj)*' _ ., , t/ to find that (lil) then yields: 

() £ T ^ rog+ (Axi . . . £m)r™...,r m ^ 

with pi,... ,p„+i e {n,. . . ,?"„,,} due to pi, . . . ,p„ e {n, . . .,r mo } and p„+i = s ; = r m . Since by 



Lemma 25 (m), r\ < r^ < ■ ■ ■ < r m holds, we have pi < P2 < . . . < p n = r mo by the induction 
hypothesis, and know too < to and r m = p n +i, we also obtain p\ < p 2 < ■ . ■ < p n < Pn+i = r m . 
In this way we have successfully performed the induction step. ■* 

The following lemma relates the length of binding-capturing chains in an infinite A-term T 
with the length of abstraction prefixes in generated subterms of T with respect to ^ rcg + , that 
is, the length of abstraction prefixes of terms that can be obtained from ()T by ^ rog +-rewrite 
sequences in the not position-annotated system Reg . 

*■ Lemma 35. Let T e Ter(X°°). For all n e N it holds: T contains a binding-capturing chain 
of length max {n - 1, 0} if and only if there is a rewrite sequence ()T -» reg + (Xxi . . . x n ) U 
(in Reg* ) for some (Xx\ . . . x n ) U e ST (T) . 
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Proof. Let T e Ter(A°°). For showing the direction "<=", we assume a rewrite sequence in 
Reg + of the form ()T -»reg + (A#i . . . x n ) U. By Proposition 26 (m), this rewrite sequence can 



be lifted to a rewrite sequence () T -» re g+ (Xx\ . . -x n ) U in Reg, pos . Then it follows 

from Lemma 34 (ul, that there exists a binding-capturing chain in T of length max {n - 1,0}. 



Now we show the direction "=>". For the case n = nothing has to be shown. Now 
we let n > 1, and suppose that p\ °— (72 ~* P2 ° — • • • ~* Pn is a binding-capturing chain in 
T of length n — 1. Then by an appeal to Lemma |34[ (m}, we obtain a rewrite sequence 
r in Reg + pos of the form () T -» rag + (Aa;i . . . x m ) r ™ r [/ for positions ri, . . . ,r m e Pos(T) 
with m > n such that pi, . . . ,p n € { r ii • ■ ■ , r m}, and pi < P2 < ■ ■ ■ < p n = r m . By possibly 
repeated application of Lemma 25 (uiik, we obtain that r is actually of the form QnT -» rog + 

(\xi . ..x n ) r " J7o -»-reg+ (Axi . ..«m)r7... r ^- B y a PPly m S Proposition! 



20 



to the 

first seqment of r displayed here we obtain the -> rcg +-rewrite sequence ()T -» reg + (Axi . . . x n ) Uq 
in Reg by projection through just dropping the position annotations. By the definition of 
ST + (T) (see Definition [7} , (Aon ...*„) I7 6 ST + (T) follows. -« 

The lemma below states a condition that guarantees an infinite binding-capturing chain 
in an infinite A-term: the existence of an infinite -^ rcg +-rewrite sequence in which the length 
of the abstraction prefixes tends to infinity in the limit. 

► Lemma 36 (infinite binding-capturing chains). Let T be a X-term, and let t be an infinite 
-^rcg+ -rewrite sequence ()T = (Xx )T -^ re g+ (Axi)Ti -^ re g+ •■• such that lim^oo |5j| = 00. 
Then there exists an infinite binding-capturing chain in T. 



Proof. Let T and r be as in the assumption of the lemma. By Proposition 20 (i), r can be 
lifted to a rewrite sequence with position annotations: 

T pos . qcji _ (^Xx ) e T a -^ rcg+ (XxiY^Ti -> reg + . . . -^ rcg+ (XxiY^.Ti -^ rcg + . . . 

where, for all i € N, qi are positions and pi = (pi, . . . ,p mi ) vectors of positions, with TOj e N. 
Due to linij^oo |5j| = 00 it follows that liminf j_»oo |xj| = 00, and hence there exists a sequence 
{ij}j«N °f increasing = Iq < i\ < ii < . . . natural numbers such that = \xi \ < \x'i 1 \ < \xi 2 \ < . . ., 
and \xi- < \xk\ for all j,k eN with k > ij. Now if the rewrite sequence r pos is written with 
highlighted segments of the form: 

rP° s : (YT=(Xx l0 )llT l0 ^ cg+ ... 

«, . «, +1 ( 5 ) 

(where j e N), then it follows, for all j 6 N, that \pi\ < \pi +1 |, and that all terms of the 
sequence after (Xxi)J 3 J*, have an abstraction prefix of length greater or equal \xi\. Due to 
the property of steps in Reg* s to remove position annotations only when the corresponding 
abstraction variable is dropped from the prefix in an ->-s-step, it follows that pi- <pi +1 holds 
in the prefix order, for all j e N, and hence that pi < pi t < pi 2 < . . .. 

Consequently, the position vectors pi. tend towards an infinite vector (j , i,r2,r2,---) of 
positions such that there are ni < ri2 < . • • in N with pi. = (ri,r2,---,r n .) for all jeN with 
j > 0. But then induction on the segmented structure of r pos as indicated in (pi), thereby 



using Lemma 34 (HI), establishes the existence of positions si, S2, ■ ■ ■ such that: 
n <^ si -» r 2 <^ . . . -» r nj . ^ s nj . +1 -* r„ j+1 o- s„ j+2 -*...»- s n . +1 -* r„. +1 
holds, thereby yielding an infinite binding capturing chain in T. 
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Now we formulate and prove the main theorem of this section, which applies the concept 
of binding-capturing chain to pin down, among all infinite A-terms that are regular, those 
that are strongly regular. 

► Theorem 37. A regular X-term is strongly regular if and only if it contains only finite 
binding- capturing chains. 

Proof. Let T be an infinite A-tcrm that is regular. 

For showing "=>", suppose that T is also strongly regular. Then by the definition of 
strong regularity, ST + (T) is finite. Let n be the length of the longest abstraction prefix of a 



term in ST (T). Then Lemma 35 implies that the length of every binding-capturing chain 
in T is bounded by max{n- 1,0}. Hence T only contains finite binding-capturing chains. 

For the implication "•<=" we argue indirectly: assuming that T is not strongly regular, we 
show the existence of an infinite binding-capturing chain in T . 

So suppose that T is not strongly regular. Then ST + {T) is infinite, and so ()T has 
infinitely many ->- reg -reducts. Since the rewrite strategy -*- r eg+ has branching degree < 2 
(branching only happens at sources of -»<§; -steps), it follows by Konig's Lemma^jthat there 
is an infinite rewrite sequence: 

r : ()T = (Xx Q )T ^ rcg + (AJi)Ti -^ TOg + ■ ■ • "*reg+ (AXj)Tj -*- reg + . . . 



that passes through distinct terms. By Lemma 11 (m), this rewrite sequence projects to: 
f •■ ()T = (Xx' )T ^ rcg (Axi)Ti -^ rog . . . -^ rcg (Xx'AT -» reg . . . , 

so that for all j e N: (Xx d ) T } -» del (Xx'^Tj (6) 

thereby respectively shortening the length of the abstraction prefix. Since T is regular, 
ST(T) is finite, and hence only finitely many terms occur in f. Now we use this contrast 
with r together with ([6]) to show that the prefix lengths of terms in r tend to infinity. 

Suppose that lim^,*, \£i\ = oo does not hold. Then there exists Iq e N such that \$i\ < Iq 
for infinitely many i € N. Hence there is a sequence iq < i\ < «2 < *3 < ■ • ■ in N such that: 

A := {(Xx l3 ) T l3 | j e N} is infinite (7) 

(since the terms on r are distinct), and [xtA < Iq for all (Xxi j )Ti j e S. On the other hand: 

B := {{Xx'^ | j 6 N} c ST(T) is finite 

because T is regular. However, since every term in A has a -»dei-reduct in B due to (16b , as 
well as an abstraction prefix of a length bounded by Iq, it follows by Proposition[To) (El, that 
A also has to be finite, in contradiction with d7j|. Hence we conclude: lim^oo \$i\ = oo. 
Now Lemma [36] is applicable to r, and yields an infinite binding-capturing chain in T. ■« 

By adding the statement of Proposition [12] we obtain the following accentuation of this 
theorem. 

► Corollary 38. An infinite X-term is strongly regular if and only if it is regular, and contains 
only finite binding-capturing chains. 



4 Here we use the following version of Konig's Lemma: Let G be a rooted directed graph with root r. 
Suppose that G has infinitely many vertices, that every vertex of G is reachable from r via a directed 
path, and that every vertex has finite out-degree (finitely many successor vertices). Then there exists 
an infinite directed path in G that starts at r and is simple (no repetitions of vertices). 



C. Grabmayer and J. Rochel 19 



5 Expressibility by terms of the A-calculus with // 

Having adapted (in Section El) the concept of regularity for infinite A-terms in two ways, we 
now obtain an expressibility result for one of these adaptations that is analogous to that in 
[5] for regular first-order trees with respect to rational expressions (or equivalently, /i-terms). 
We show that an infinite A-term is strongly regular if and only if it is A M -expressiblc. 
We first define terms of A M , the unfolding rewrite relation, and A^-expressibility. 

► Definition 39 (CRS-representation for A M ). The CRS-signature S^ = S^ u {mu} for A M 
extends £> by a unary function symbol mu. By Ter(X fJ/ ) we denote the set of closed finite 
CRS-terms over E^ with the restriction that CRS-abstraction occurs only as an argument 
of the symbols abs or mu. By Ter((A)A M ) we denote the analogously defined set of terms 
over the signature £(a)a u {mu}. We consider the (i-unfolding rule in informal and formal 
notation: 

(^): fix.M(x) ->■ M(fj,x.M(x)) ^ : mu([x]Z(x)) ->■ Z(mu([x]Z(x))) 

This rule induces the unfolding rewrite relation -^ on Ter(X^) and Ter((A)A /J ). We say 
that a A^-term M expresses an infinite A-term V if M -»»- M V holds, that is, M unfolds 
to V via a typically infinite, strongly convergent -^-rewrite sequence (similar for terms 
in Ter((A)A M )). And an infinite A-term T is A M - expressible if there is a A^-term M that 
expresses T. 

We sketch some intuition for the proof, which proceeds by a sequence of proof-theoretic 
transformations. We focus on the more difficult direction. Let T be a strongly regular infinite 
A-term. We want to extract a A^-term M that expresses T from the finite -> rcg + -reduction 
graph G of T. We first obtain a closed derivation T> of ()T in Regg. The derivation V can 
be viewed as a finite term graph that has G as its homomorphic image, and that does not 
exhibit horizontal sharing ([3J Sec. 4.3]). Such term graphs correspond directly to A M -terms 
(analogous to [3]). In order to extract the A^-term M corresponding to T> from this derivation, 
we annotate it inductively to a A^-term- annotated derivation T> with conclusion () M : T in a 
proof system Expr that is a variant of Regg . Then it remains to show that M indeed unfolds 
to T. For this we prove that T> unfolds to/gives rise to infinite derivations in the variant 
systems Expr°° and Unf °° , which witness infinite outermost rewrite sequences M ->» M T. 

The CRS consisting of the rule p M is orthogonal and fully-extended [TS] • As a consequence 
of the result in [TT] that outermost-fair strategies in orthogonal, fully extended iCRSs are 
normalizing, we obtain the following proposition. 

► Proposition 40. Let M e Ter(X fl ) and T € Ter(\°°). If M expresses T, then there is an 
outermost -^-rewrite sequence of length <uj that witnesses M -»^ T, and T is the unique 
A-term expressed by M. Analogously for prefixed terms in A M that express prefixed A-terms. 

Hence the infinite outermost unfolding rewrite relation -*»►' to infinite normal form defines a 
partial mapping from Ter(A M ) to Ter(A°°), and from Ter((A)A M ) to Ter((A)A°°). 

The relation -»>■' can be defined via derivability in the proof system Unf °° in Fig. 6 the 
existence of a possibly infinite derivation that is closed in the sense of Def. |13| and admissible, 
i.e. it is (S)-eager, and does not contain infinitely many consecutive instances of the rule (fi). 

► Proposition 41. Unf °° is sound and complete w.r.t. -»►' : For all (Xx) T e Ter(( A) A°°) and 
(Xx)MeTer((\)\^), H Unf ~ (Ax) M ^> (Xx) T holds if and only if (Xx)M °&l (Xx)T. 
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(Xx) M M- (Xx) T (Xx) Mi ^> (Xx) T x 



(Xxy)y^>(Xxy)y (Xx)M M 1 ^> (Xx)T T x 



(Xxy)M^>(Xxy)T (Xx)M^>(Xx)T 



(Xx) Xy.M ^> (Xx) Xy.T (Xxy) M ^> (Xxy) T 

(Xx)M(^f.M(f))^>(Xx)T 



S (if the binding Xy 
is vacuous 
in M and T) 



(Xx)f,f.M(f)^>(Xx)T 



Figure 6 Proof system Unf °° for completely unfolding of A M -terms into infinite A-terms. 

► Definition 42 (proof systems Expr, Expr°°, and Expr , Expr~). The natural-deduction- 
style proof system Expr has as its formulas abstraction-prefixed A M -terms annotated by 
infinite A-terms, and the rules in Fig. [71 The system Expr has abstraction-prefixed A^-terms 
as formulas, and its rules arise from Expr by dropping the A-terms. Derivability in these 
systems means the existence of a closed (no open assumptions), (S)-eager, finite derivation. 
The variant Expr°° of Expr arises by replacing the rule (FIX) with the rule (fj,) in Fig.pl 
Expr~ arises from Expr analogously. A derivation in cither of these systems is called 
admissible if it does not contain infinitely many consecutive instances of (/i), and if it is 



(S)-eager in the sense of Dcf. 15 Derivability in these systems means the existence of an 



admissible derivation that is closed in the sense of Def. [13] 

We first observe that derivations in Expr 00 are closely linked to derivations in Unf °° . 

► Lemma 43. hE xpr » (Xx) M : T holds if and only if i~u n f~ (Xx) M ==> (Xx)T holds, for 
all (Xx)M eTer((X)X fl ) and (Xx)T e Ter((A)A°°). 

Proof. Derivations in Expr°° and in Unf °° differ only in the notation used for their formulas. 
A formula (Ay) N : U in Expr 00 corresponds to the formula (Xy) N ==► (Xy)U in Unf°°. 
This correspondence extends to a correspondence between derivations of Expr°° and Unf °° , 
which preserves and reflects the property of derivations to be closed and admissible. ■* 

The lemma below gathers basic properties of the proof systems Expr and Expr°° . 

► Lemma 44. (i) For every X^-term M: i-gxpr ()M if and only if i-gxpr ()M. 
(ii) Every closed derivation in Expr°° contains only finitely many X^-terms. 

(Hi) For every X^-term M it holds: i-gxpr ()M if and only if there is no -> rog + -generated 
subterm of M (in ST (M)) of the form ()fJ.xo ■ ■ ■ x n .xo for n e N. 

Proof. For m, in order to show "=>■" let T> be a finite, closed, (S)-eager derivation in Expr 
with conclusion ()M. By 'unfolding' this derivation through a process in which in each step: 

m)ci] 1 
m) Cl ] 1 Vo{cl) 

a subdenvation . , ,,,,,.. (Xy)N(ci) 

. , , , , A)(Q) is 'unfolded into FIX J 

of a bottommost ,,-x Ar , N , , . ,. [(Xy)nf.N(f)] 

„ „^ r (\y)N(ci) a subdenvation , , , , 

instance of FIX y "' y FIX,/ T> (uf.N(f)) 



(Xy)yf.N(f) 



(Xy)N(fif.N(f)) 
(Xy)uf.N(f) 
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(Xxy) M : T 



(Xxy) y : y 



X 



(Ax) Xy.M : Xy.T 

(AS) M ■■ T 
(Xxy) M : T 



(Ax) M : T (Ax) Mx : T x 

(A5)M Mi:T Ti 

S (if the binding Ay is vacuous) 



[(Ax) cj : T] ; 

(Ax) M(q) : T (if |D | > 1, and \y\ > \x\ for all (Ay) N : U on threads 

(Ax) uf.M( f) '■ T ' from open assumptions ((AS) q : T) down) 

Figure 7 Natural-deduction style proof system Expr for expressibility of infinite A-terms by 
A^-terms. The proof system Expr for Ajj-terms that express infinite A-terms arises by dropping the 
colons ':' and the subsequent infinite A-terms. Derivations in Expr and Expr must be (S)-eager. 



(Ax) M(/z/.M(/)) : T 
(Ax) fif.M(f) : T 



Figure 8 The proof system Expr°° for expressibility of A-terms by A M -terms arises from Expr by 
replacing the rule FIX with the rule /i. The proof system Expr°° for A M -terms that express A-terms 
arises from Expr°° by dropping the colons ':' and the subsequent infinite A-terms. Admissible 
derivations in Expr" 



and in Expr°° do not have infinitely many consecutive instances of ji. 



in the limit a closed derivation T in Expr~ is obtained with the same conclusion as V. 
Furthermore, T does not contain infinitely many consecutive instances of fj,, since the side- 



condition on (FIX) guarantees a guardedness condition analogous to Proposition 18 Hence T 
is a closed admissible derivation in Expr~ with conclusion ()M. For showing "<=", suppose 
that T is a closed, admissible derivation in Expr~ with conclusion ()M. Then there is a 
finite closed derivation T> with the same conclusion in the variant system Expr that does 
not require the side-condition part |X>o| ^ 1 for instances of (FIX). Via the process described 
above, V unfolds to a closed, (S)-eager derivation in Expr~, which has to be equal to T, 
since closed (S)-eager derivations in Expr~ are unique (due to the rules of this system). If T> 
would not satisfy the guardedness condition described in Proposition|18| and therefore would 
also violate the mentioned side-condition part, for any of its (FlX)-instances, then T would 
not be admissible. It follows that V is a closed derivation in Expr with conclusion ()M. 

For (jiih note that by the argument for "<=" in (|ij), every closed derivation in Expr~ is the 
unfolding of a closed derivation in Expr , and that the unfolding process can produce only 
finitely many A^-terms. Statement (fiiib follows by an easy analysis of closed derivations in 



Expr _ that violate the guardedness condition in Proposition 18 on any of its (FlX)-instances 



The lemma below links derivability in Expr with derivability in Expr" 
establishes this link via 'unfolding' and 'folding' of derivations. 



Its proof 



► Lemma 45. i-gxpr (Ax) M : T holds if and only if 
(Ax)Af eTer((A)A M ) and (Ax)T eTer((A)A~). 



Expr" (Ax) M : T holds, for all 



Proof. For "=>■" let T> be a closed, (S)-eager, and finite derivation in Expr with the conclusion 
(Ax) M:T. By an unfolding process and arguments analogous as described in the proof 
of Lemma 44 (m), V unfolds to a closed, admissible derivation T in Expr°° with the same 
conclusion (Ax) M:T 

For "<=", suppose that T is a closed, admissible derivation in Expr°° with conclusion 
(Ax) M : T. By changing the notation of the formulas used in T to the notation for formulas 
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in Unf °° as explained in the proof of Lemma 43 a closed, admissible derivation T' in Unf 



with conclusion (Xx)M => (Xx)T is obtained. Since subderivations of closed admissible 
derivations in Unf °° are again such derivations, it follows from the soundness of Unf °° with 



respect to -*»►' (cf. Proposition 41 1, and from the uniqueness of the infinite unfolding (if 



it exists) of a A^-term (cf . Proposition 40 1 that T 1 does not contain more infinite prefixed 



A-terms than prefixed A^-terms. By dropping the symbols => and the infinite A-terms on 
the right in T' (or by dropping the colons ':' and the infinite A-terms on the right in T) a 



closed admissible derivation 7^ in Expr is obtained. Due to Lemma 44 rtiil, 7^ contains 
only finitely many prefixed A^-terms. Due to the construction of 7^ this holds for 7" and T' 
as well. Since T' does not contain more infinite A-terms than A M -terms, it follows that T 1 
contains only finitely many formulas. Due to the correspondence between T 1 and 7", this 
holds also for 7". 

Therefore 7" can be 'folded', similar as in the proof of Theorem [20] into a finite closed 
derivation V in Expr with conclusion (AS) M : T by introducing (FlX)-instances to cut 
off the derivation above the upper occurrence of a repetition (the side-condition on such 
instances of (FIX) is guaranteed due to the admissibility of 7") . < 

By gathering properties of the systems Expr, Expr°°, and Unf°° that we have shown 
we can now justify the name of the system Expr on the grounds that this system formalizes 
the property of (prefixed) A^-terms to express (prefixed) infinite A-terms. 

► Theorem 46. The proof system Expr is sound and complete with respect to X^-expressib- 
ility. That is, for all expressions (AS) M:T of X^-term- annotated, prefixed infinite X-terms 
it holds that i-E xpr (AS) M:T if and only if (Ax) M expresses (Xx)T. 

Proof. By chaining the equivalences stated by Proposition [44] Lemma|43] and Lemma|45] •« 

The following lemma establishes the correspondence between derivability in the proof 
system Reg^ from Sections] and derivability in Expr. The crucial part of the proof consists 
in the constructive extraction, from a closed derivation T> in Reg^ with conclusion (AS)T, of 
a A M -term (Xx) M that describes the form of the derivation T>, and (as the results gathered 
in this section will show) also unfolds to (Xx)T. 

► Lemma 47. For all infinite prefixed X-terms (Xx)T , i- Reg + (AS)T holds if and only if 
there exists a prefixed X^-term (Xx) M such that i-gxpr (AS) M : T holds. 

Proof. For the implication "=>" it suffices to show that every derivation V in RegQ with 
conclusion (Xy) U and possibly with open assumptions can be transformed, by adding 
appropriate annotating A M -terms in the formulas of T>, into a derivation T> in Expr with 
conclusion (Xy) N : U and corresponding (if any) open assumptions, and such that the same 
variables in y = (j/i, . . . ,y n ) occur free in TV as in U. This can be established by induction on 
the depth \D\ of T>. In the base case, axioms (0) of Rego are annotated to axioms (0) of 
Expr, and marked assumptions ((Xz)V) 1 in Rego to marked assumptions ((Xz) c; : V) 1 . 

In the induction step it has to be shown that a derivation T> in Rego with immediate 
subderivation T>o can be annotated appropriately to a derivation T) in Expr, making use 
of the induction hypothesis that guarantees an annotated version T>q of T>$. For obtaining 
T> from 2?o the fact is used that the rules in Expr uniquely determine the A^-term in the 
conclusion of an instance, once the A /i -term(s) in the premise(s) (and in the case of (FIX) 
additionally the constants in the assumptions that are discharged) are given. In order to 
establish that instances of (S) in V give rise to corresponding instances of (S) in T>, the 
part of the induction hypothesis is used that guarantees that the A^-term annotation in the 
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premise of the rule contains precisely the same variable bindings from the abstraction prefix 
as the A-term it annotates. 

For showing "<=", let T> be a closed derivation in Expr with conclusion (Xx) L : T. Then 
a closed derivation T> in Regg with conclusion (Xx)T can be obtained by simply dropping 
the annotating A M -terms in formulas of T>. * 



► Example 48. The derivation T>i in Regg from Example 19 W, on the left can be annotated, 
as described by Lemma 47 to obtain the following derivation T>i in Expr : 



(Oq:T)' 
(Ax) q : T 
(Xxy) C; : T 



(Xxy) y ■■ y 



(Xx) 







(Xxy) Ciy.Ty 



(Xxy) x : x 



(Xxy) ciyx : Tyx 

(Ax) Xy.ciyx : Tyx 

() Xxy.Qyx : Xxy.Tyx 



><i 



X 



A 
FIX,m 



()fj,f.Xxy.fyx: T 
Note that the A^-term in the conclusion unfolds to T, the infinite A-term in Fig. IT] 

Now we can prove our main result on A^-expressibility by composing the proof-theoretic 
transformations developed in this section, and by applying the characterization, from Section|3] 
of strong regularity of infinite A-terms via derivability in Regg . 

► Theorem 49. An infinite X-term is X^-expressible if and only if it is strongly regular. 

Proof. For all infinite A-terms T it holds: 



T is A M -expressiblc 



3MeTer(A M ). M ^T 



3MeTer(AA M°&\T 



(by the definition of 

A M -expressibility) 



("= 



by Proposition[40| 
due to -»»' £ ->» M ) 



unf 



(by Lemma 43 ) 



3M 6 Ter(A^). H Unf ~ ()M =^> ()T (by Proposition|5T I 
3MeTer(A M ). H Expr - () M : T 
3AfeTer(A M ). H Expr () M : T 

T is strongly regular 



(by Lemma 45 ) 



(by Lemma 47) 

(by Theorcm[20|(|u]), 



which establishes the statement of the theorem. ■* 

From Theorem |49| and Corollary |38| we obtain a theorem that condenses our main results. 

► Theorem 50. For all infinite X-terms T the following statements are equivalent: 

(i) T is X^-expressible. 
(ii) T is strongly regular. 
(Hi) T is regular, and it only contains finite binding-capturing chains. 

6 Generalization to A| etrec and practical perspectives 

In [S] we undertook an in-depth study of expressibility in A| e t re c? an d obtained the more 
general, but analogous result for full A| etrec instead of only for A M . While there are significantly 
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more technicalities involved, the structure of the proofs is analogous to here. Instead of 
demanding eager application of the scope-delimiting rules g and g , respectively, there we 
study A-term decomposition ->-f eg and ->f- cg + for arbitrary scope-delimiting strategies S. 

Concepts introduced here and in [5] have the potential to be practically relevant for the 
implementation of functional programming languages. In |10j we study various higher-order 
and first-order term-graph representations of cyclic A-terms. Their definitions draw heavily 
on the decomposition rewrite systems in this paper. That is, every term in A| etrec can be 
translated into a finite first-order 'A-term-graph' by applying the rewrite strategy -> rog + to 
the expressed strongly regular, infinite A-term. Thereby vertices with the labels A, @, S are 
created according to the kind of -*- reg +-step observed (plus variable occurrence vertices with 
label 0). The degree of sharing exhibited by A-term-graphs can be analyzed with functional 
bisimulation. In |10j we identify a class of first-order representations with eager application 
of scope closure that faithfully preserves and reflects the sharing order on higher-order term 
graphs. This leads to an algorithm for efficiently determining the maximally shared form of a 
term in A| etrec , which can be put to use in a compiler as part of an optimizing transformation. 

Another aspect is that functional programming languages based on the A-calculus with 
letrec restrict the set of (in the unfolding semantics) expressible terms to the strongly regular 
infinite A-terms. But members of the superclass of regular terms are also finitely expressible 
via sets of equations or CRS- rules. Therefore the question arises whether finite representations 
of regular terms afford new opportunities in compiling functional programming languages. 
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